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METHODS OF CONSTRUCTING AND SCREENING A LIBRARY OF VARI AOTS 
OF A POLYNUCLEOTIDE SEQUENCE IN FILAMEOTOUS FUNGAL CELLS 

5 Background of the Invention 

Field of the Invention 

The present invention relates to methods of constructing and screening a library of 
variants of a polynucleotide sequence in filamentous fungal cells. 

10 

Description of the Related Art 

Filamentous fungi have been widely used as host cells for the commercial production of 
polypeptides. However, when it is desirable to produce a variant of the polypeptide with 
specified altered characteristics, e.g., thermostability, pH activity profile, specific activity, 

15 substrate specificity, K^, etc., the construction and screening of a library of variant en- 
coding sequences commonly requires the use of an intermediate host, e.g., bacterial cells or 
yeast, due to the low frequency of transformation and a variation in copy number among 
independently transformed filamentous fungal cells. 

Several methods for the construction of libraries of variants of a polynucleotide se- 

20 quence of interest in yeast have been disclosed in which the libraries are screened in yeast 
prior to transformation of a production relevant host, such as, for example, filamentous 
fungi with the potential variant polynucleotide sequences of interest. 

Often however, a polynucleotide sequence identified by screening in yeast or bacteria 
cannot be expressed or is expressed at low levels when transformed into production relevant 

25 filamentous fungal cells. This may be due any number of reasons, including differences in 
codon usage, regulation of mRNA levels, translocation apparatus, post-translational modifi- 
cation machinery (e.g., cysteine bridges, glycosylation and acylation patterns), etc. 

Secondly, whether a polynucleotide sequence of interest would be expressed in the pro- 
duction host at commercially useful levels is not necessarily predictable. For example, if 

30 the organism used in screening the library is, e.g., a bacterial or yeast cell and the produc- 
tion relevant host cell is a filamentous fungal cell, the protease profiles differ. Thus, a se- 
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quence encoding one or more characteristics of interest which has been identified in yeast 
may be degraded by proteases expressed in the product relevant filamentous fungal host 
cell. Furthermore, to obtain optimized yields of the expressed product by altering the 
function of regulatory proteins or regulatory sequences requires direct manipulation of the 
5 production host. 

A. Aleksenko and A. J. Clutterbuck (1997. Fungal Genetics and Biology 21:373-387) 
disclose the use of autonomous replicative vectors, or autonomously replicating sequences 
(ARS), for gene cloning and expression studies. AMAl (autonomous maintenance in As- 
pergillus) is one of the plasmid replicator elements discussed. It consists of two inverted 
10 copies of a genomic repeat designated MATEl (mobile Aspergillus transformation enhan- 
cer) separated by a 0.3 kb central spacer. AMAl promotes plasmid replication without re- 
arrangement, multimerization or chromosomal integration. 

15 Summary of the Invention 

It has been found that AMAl -based plasmids provide two advantages in gene cloning in 
filamentous fungi. The first is a high frequency of transformation which both increases the 
potential library size and can eliminate the need for library amplification in an intermediate 

20 host, e.g., E. coli, so that a recipient Aspergillus strain can be transformed directly with a 
ligation mixture. Secondly, by providing a stable and standard environment for gene ex- 
pression, the properties of the transformants will be uniform. 

It is an objective of the present invention to provide improved methods for constructing 
and screening libraries of mutated DNA sequences of interest in filamentous fungal cells by 

25 use of an episomal replicating plasmid to provide a high frequency of transformation and a 
uniformly high level of gene expression among independently transformed cells. By mini- 
mizing variation in copy number among independently transformed cells, a variant 
polypeptide of interest may be identified directly on the basis of expression of the charac- 
teristic(s) of interest. 
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Accordingly, in a first aspect of the present invention relates to a method of 
constructing and selecting or screening a library of variants of a polynucleotide sequence of 
interest in filamentous fungal cells wherein the method comprises: 

(a) transforming the fungal cells with a population of DNA vectors, wherein each 
5 vector comprises: 

(i) a polynucleotide sequence encoding a fungal selective marker and a fungal 
replication initiating sequence, wherein the marker and the replication initiat- 
ing sequence do not vary within the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors 
10 contains more than one variant of the polynucleotide sequence; 

(b) cultivating the cells under selective pressure; 

(c) selecting or screening for one or more transformants expressing a desired charac- 
teristic encoded in the variant polynucleotide sequence of (ii) above; and 

(d) isolating the transformant(s) of interest. 

15 In other aspects, the invention relates to the use of a fungal replication intiating 

sequence to construct a library of variants of a polynucleotide sequence of and to a library 
of such variant polynucleotide sequences. 



Brief Description of the Figures 

20 

Figure 1 shows a restriction map of the plasmid pENI1298, the construction of which 
is described in Example 1. 

Figure 2 shows a restriction map of the plasmid pENI1299. the construction of which 
is described in Example 1. 

25 

[List of SEQ ID Nos: 

SEQIDNO:! The fungal replication initiating sequence from A. nidulans (AMAl 
sequence) 

SEQ ID NO : 2 truncated AM A 1 

30 

DetaUed Description of the Invention 
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In a first embodiment, the present invention relates to a method of constructing and se- 
lecting or screening a library of variants of a polynucleotide sequence of interest in fila- 
mentous fungal cells, wherein the method comprises: 
5 (a) transforming the fungal cells with a population of DNA vectors, wherein each 

vector comprises: (i) a polynucleotide sequence encoding a fungal selective marker 
and a fungal replication initiating sequence, wherein the marker and the replication 
initiating sequence do not vary within the population; and (ii) a polynucleotide se- 
quence of interest, wherein the population of DNA vectors contains more than one 
10 variant of the polynucleotide sequence; 

(b) cultivating the cells under selective pressure; 

(c) selecting or screening for one or more transformants expressing a desired charac- 
teristic encoded in the variant polynucleotide sequence of (ii) above; and 

(d) isolating the transformant(s) of interest. 

15 The term, "a library of variants of a polynucleotide sequence of interest," is defined 

herein as a collection of variants wherein the variants differ from the parent polynucleotide 
sequence(s) by comprising one or more modifications of the parent polynucleotide se- 
quence. Conveniently, the variants are generated by mutagenesis, preferably random 
mutagenesis of at least one parent polynucleotide sequence of interest, resulting in a mini- 

20 mum number of four variants in the collection. The variants may also result from naturally 
occurring allelic variation of the parent polynucleotide sequence. An allelic variant denotes 
any of two or more alternative forms of a gene occupying the same chromosomal locus. 
Allelic variation arises naturally through mutation, and may result in phenotypic polymor- 
phism within populations. Gene mutations can be silent (i.e., no change in the encoded 

25 polypeptide) or may encode polypeptides having altered amino acid sequences. The parent 
sequence(s) may be any polynucleotide sequence having or encoding a biological function 
of interest and comprising a subsequence of at least six nucleotides which may be modified 
to produce the variant. 

In a preferred embodiment the parent polynucleotide sequence is a polypeptide encod- 

30 ing sequence. The term "polypeptide" encompasses peptides, oligopeptides, and proteins and, 
therefore, is not limited to a specific length of the encoded product. The parent polypeptide to 
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be used according to the method of the invention may be native to the host cell or may be a 
heterologous polypeptide. The term "heterologous polypeptide" is defined as a polypeptide 
which is not native to the host cell. The polypeptide may also be a recombinant polypeptide 
which is a polypeptide native to a cell, which is encoded by a nucleic acid sequence which 
5 comprises one or more control sequences, foreign to the nucleic acid sequence, which are in- 
volved in the production of the polypeptide. The nucleic acid sequence encoding the 
polypeptide may have been manipulated in some manner as described infra. The polypeptide 
may be a wild-type polypeptide, or a variant thereof, subjected to the method of the present 
invention. The polypeptide to be used in the method of the present invention may also be a 

10 hybrid polypeptide which contains a combination of partial or complete polypeptide se- 
quences obtained from at least two different polypeptides where one or more of the polypep- 
tides may be heterologous to the cell. Polypeptides further include naturally occurring allelic 
and engineered variations of the above mentioned polypeptides. 

In another preferred embodiment the parent nucleotide sequence is a control sequence 

15 normally associated with a polypeptide encoding sequence. Control sequences include all 
components which are operably linked to the nucleic acid sequence encoding the polypeptide 
sequence or otherwise involved in the production of the polypeptide. Such control sequences 
include, but are not limited to, a promoter, a signal sequence, a propeptide sequence, a tran- 
scription terminator, a leader, a promoter recognition sequence, an enhancer sequence and a 

20 polyadenylation sequence as described herein. In a still further embodiment the polynucleo- 
tide sequence of interest is a combination of a polypeptide encoding sequence (or a part of 
such sequence) and a) a control sequence (or part of such control sequence) or b) two or more 
control sequences (or parts of such sequences). Each of the control sequences may be native 
or foreign to the coding sequence. 

25 In a preferred embodiment, the polypeptide is an antibody or portions thereof, an antigen, 

a clotting factor, an enzyme, a hormone or a hormone variant, a receptor or portions thereof, a 
regulatory protein, a structural protein, a reporter, or a transport protein. 

In a more preferred embodiment, the enzyme is an oxidoreductase, transferase, hydrolase, 
lyase, isomerase, or ligase. 

30 In an even more preferred embodiment, the enzyme is an aminopeptidase, amylase, car- 

bohydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, deoxyribonuclease, dex- 
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tranase, esterase, alpha-galactosidase, beta-galactosidase, glucoamylase, alpha-glucosidase, 
beta-glucosidase, haloperoxidase, invertase, laccase, lipase, mannosidase, mutanase, oxidase, 
pectinolytic enzyme, peroxidase, phytase, polyphenoloxidase, proteolytic enzyme, ribonucle- 
ase, transglutaminase, or xylanase. A specific example of a lipase is a lipase derived from 
5 Thermomyces lanuginosa or a variant of said lipase, eg a variant in which an N-terminal ex- 
tension has been added to the mature lipase enzyme as disclosed in WO 97/04079. 

In another embodiment, the polypeptide is human insulin or an analog thereof, human 
growth hormone, erythropoietin, or insulinotropin. 

In a further preferred embodiment the control sequence is a promoter sequence, prefera- 
10 bly a fungal promoter, such as a promoter derived from the gene encoding Aspergillus oryzae 
TAKA amylase, NA2-tpi (a hybrid of the promoters from the genes encoding A. niger neu- 
tral a-amylase and A, oryzae triose phosphte isomerase) and Aspergillus niger or Asper- 
gillus awamori glucoamylase. 
In another preferred embodiment the control sequence is a promoter recognition sequence 
15 such as the amyR recognition sequence (WO 98/01470), the creA (WO 94/13820) and areA 
(WO 95/35385). 

Examples of further control sequences of interest in connection with the present inven- 
tion is listed further below in the section entitled "DNA vectors and control sequences". 

20 Filamentous fungal s elective marker 

The term "selective pressure" is defmed herein as culturing a filamentous fiingal cell, 

containing a DNA vector containing a fungal selective marker gene operably linked to a 

polynucleotide sequence of interest, in the presence of an effective amount or the absence of 

an appropriate selective agent. The effective amount of the selective agent is defined herein 
25 as an amount sufficient for allowing the selection of cells containing the selection marker 

from cells which do not contain the selection marker. 

In a preferred embodiment, the fungal selective marker is selected from the group of 

genes which encode a product capable of providing resistance to biocide or viral toxicity, 

resistance to heavy metal toxicity, or prototrophy to auxotrophs. 
30 In a more preferred embodiment, the prototrophy is obtained from an enzyme selected 

from the group of metabolic pathways consisting of nucleotide synthesis, cofactor synthesis. 
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amino acid synthesis, acetamide metabolism, proline metabolism, sulfate metabolism, and 
nitrate metabolism. 

In an even more preferred embodiment, in the methods of the present invention the 
fungal selective marker is a gene selected from the group consisting of argB (ornithine car- 
5 bamoyltransferase), amdS (acetamidase), bar (phosphinothricin acetyltransferase), hemA (5- 
aminolevulinate synthase), hemB (porphobilinogen synthase), hygB (hygromycin phos- 
photransferase), niaD (nitrate reductase), pm (proline permease), pyrG (orotidine-5'- 
phosphate decarboxylase), pyroA, riboB, sC (sulfate adenyltransferase), and trpC 
(anthranilate synthase). 

10 The fungal cell is cultivated in a suitable medium and under suitable conditions for 

screening or selecting for transformants harbouring the variant polynucleotide sequence of 
interest having or encoding the desired characteristic. The cultivation may be performed in 
accordance with methods well-known in the art for screening of polynucleotide variant li- 
braries. 

15 

Replication initiating sequences 

As used herein, the term "fungal replication initiating sequence" is defmed as a 
nucleic acid sequence which is capable of supporting autonomous replication of an ex- 
trachromosomal molecule, e.g., a plasmid or a DNA vector, in a fungal host cell, normally 

20 without structural rearrangement of the plasmid or integration into the host cell genome. 
The replication initiating sequence may be of any origin as long as it is capable of mediat- 
ing replication intiating activity in a fungal cell. Preferably, the replication initiating se- 
quence is obtained from a filamentous fungal cell, more preferably a strain of Aspergillus, 
Fusarium or Altemaria^ and even more preferably, a strain of A. nidulans, A. oryzae. A, 

25 niger, F, oxysporum or Altemaria altemta. 

A replication initiating sequence may be identified by methods well-known in the 
art. For instance, the sequence may be identified among genomic fragments derived from 
the organism in question as a sequence capable of sustaining autonomous replication in 
yeast, (Ballance and Turner, Gene, 36 (1985), 321-331), an indication of a capability of 

30 autonomous replication in filamentous fungal cells. The replication initiating activity in 
fungi of a given sequence may also be determined by transforming fiingi with contemplated 
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plasmid replicators and selecting for colonies having an irregular morphology, indicating 
loss of a sectorial plasmid which in turn would lead to lack of growth on selective medium 
when selecting for a gene found on the plasmid (Gems et al, Gene, 98 (1991) 61-67). 
AMAl was isolated in this way. An alternative way to isolate a replication initiating se- 
5 quence is to isolate natural occurring plasmids (eg as disclosed by Tsuge et al.. Genetics 
146 (1997) 1 1 1-120 for Altemaria atemata). 

Examples of replication initiating sequences include, but are not limited to, the 
ANSI and AMAl sequences of Aspergillus nidulans, e.g., as described, respectively, by 
Cullen, D., et al. (1987, Nucleic Acids Res, 15:9163-9175) and Gems. D., et al. (1991. 
10 Gene 98:61-67). 

The term "replication initiating activity" is used herein in its conventional mean- 
ing, ie to indicate that the sequence is capable of supporting autonomous replication of an 
extrachromosomal molecule, such as a plasmid or a DNA vector in a fungal cell. 

The term "without structural rearrangement of the plasmid" is used herein to mean that 
15 no part of the plasmid is deleted or inserted into another part of the plasmid, nor is any host 
genomic DNA inserted into the plasmid. 

Preferably, the replication initiating sequence to be used in the methods of the present 
invention is a nucleic acid sequence selected from the group consisting of: 

(a) a nucleotide sequence having at least 50% identity with the nucleic acid sequence of 
20 SEQ ID NO: 1 or SEQ ID NO:2, and is capable of initiating replication in a fungal cell; 

(b) a nucleotide sequence capable of initiating replication which hybridises under low 
stringency conditions with (i) the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO:2, or 
(ii) the respective complementary strands, wherein the low stringency conditions are defined 
by prehybridisation and hybridisation at ATC in 5x SSPE, 0.3% SDS, 200 ng/ml sheared and 

25 denatured salmon sperm DNA, and 25% formamide, and wash conditions are defined at 50°C 
for 30 minutes in 2x SSC, 0.2% SDS; and 

(c) a subsequence of (a) or (b), wherein the subsequence is capable of initiating replica- 
tion in a fungal cell.. 

In a preferred embodiment, the nucleotide sequence has a degree of identity to the 
30 nucleic acid sequence shown in SEQ ID NO:l or SEQ ID NO: 2 of at least about 50%, 
more preferably about 60%, even more preferably about 70%, even more preferably about 
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80%, even more preferably about 90%. and most preferably about 97% identity 
(hereinafter "homologous polynucleotide"). The homologous polynucleotide also 
encompasses a subsequence of SEQ ID NO:l or SEQ ID NO:2 which have replication 
initiating activity in fungal cells. For purposes of the present invention, the degree of 
5 identity may be suitably determined by means of computer programs known in the art, such as 
GAP provided in the GCG program package (Program Manual for the Wisconsin Package, 
Version 8, August 1994, Genetics Computer Group, 575 Science Drive, Madison, Wisconsin, 
USA 53711) (Needleman, S.B. and Wunsch, CD., (1970), Journal of Molecular Biology, 48, 
443-45), using GAP with the following settings for polynucleotide sequence comparison: 

10 GAP creation penalty of 5.0 and GAP extension penalty of 0.3. 

Hybridisation indicates that by methods of standard Southern blotting procedures, the 
replication initiating sequence hybridises to an oligonucleotide probe derived from the 
nucleic acid sequence shown in SEQ ID N0:1 or SEQ ID NO: 2, under low to high 
stringency conditions (i.e., prehybridisation and hybridisation at 42*'C in 5x SSPE, 0.3% 

15 SDS, 200 |ig/ml sheared and denatured salmon sperm DNA, and either 25, 35 or 50% 
formamide for low, medium and high stringencies, respectively). In order to identify a 
clone or DNA which is homologous with SEQ ID NO:l or SEQ ID NO:2, the hybridisation 
reaction is washed three times for 30 minutes each using 2X SSC, 0.2% SDS preferably at 
least 50°C, more preferably at least SS^C, more preferably at least 60°C, more preferably 

20 at least 65**C, even more preferably at least 70*'C, and most preferably at least 75**C. 

The oligonucleotide probe may be the nucleic acid sequence of SEQ ID NO:l or SEQ 
ID NO:2, or a respective subsequence thereof. More specifically, the oligonucleotide probe 
can be considerably shorter than the entire sequence, but should be at least 15, preferably at 
least 25, and more preferably at least 40 nucleotides in length. Both DNA and RNA probes 

25 can be used. 

The probes are typically labelled for detecting the corresponding gene (for example, 
with ^^P, ^H, ^^S, biotin, or avidin). For example, molecules to which a "P-, ^H- or "S- 
labelled oligonucleotide probe hybridises may be detected by use of X-ray film. 

When isolating a replication initiating sequence for use in the present invention, a 
30 genomic DNA, cDNA or combinatorial chemical library prepared from such an organism 
as defined above expected to harbour the sequence is screened for DNA which hybridises 
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with the oligonucleotide probe described above which has replication initating activity. 
Genomic or other DNA from such other organisms may be separated by agarose or 
polyacrylamide gel electrophoresis, or other separation techniques. DNA from the libraries 
or the separated DNA may be transferred to and inrmiobilised on nitrocellulose or other 
5 suitable carrier material. A clone or DNA which is homologous to SEQ ID NO:l or SEQ 
ID NO: 2 may then be identified following standard Southern blotting procedures. 

The techniques used to isolate or clone a nucleic acid sequence having replication initiating 
activity are known in the art and include isolation from genomic DNA or cDNA, The 
cloning from such DNA can be effected, e,g,, by using methods based on polymerase chain 
10 reaction (PGR) to detect cloned DNA fragments with shared structural features. (See, e.g., 
Innis, et aL, 1990, PCR: A Guide to Methods and Application, Academic Press, New 
York.) Other nucleic acid amplification procedures such as ligase chain reaction (LCR) 
may be used. 

In preferred embodiment, the replication initiating sequence has the nucleic acid 
15 sequence set forth in SEQ ID NO: I or SEQ ID NO:2, or a respective functional 
subsequence thereof. For instance, a functional subsequence of SEQ ID NO:l is a nucleic 
acid sequence encompassed by SEQ ID NO:l or SEQ ID NO 2 except that one or more 
nucleotides from the 5' and/or 3' end have been deleted. Preferably, a subsequence contains at 
least 100 nucleotides, more preferably at least 1000 nucleotides, and most preferably at least 
20 2000 nucleotides. In a more preferred embodiment, a subsequence of SEQ ID NO: 1 contains 
at least the nucleic acid sequence shown in SEQ ID NO:2. 

DNA ygctor and control seqyenccs 

In the DNA vector comprising a polynucleotide sequence encoding a fungal selective 

25 marker, a fungal replication initiating sequence and a polynucleotide sequence of interest 
the polynucleotide sequence may encode a polypeptide in which case it is operably linked to 
one or more control sequences which direct the expression of the coding sequence. Alter- 
natively, the polynucleotide sequence is a control sequence in which case, depending on the 
control sequence in question, it is normally operably linked to a polypeptide encoding se- 

30 quence in order to be able to assess the activity of the control sequence (and thus be able to 
select variants of the parent control sequence having desired properties). 
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The procedures used to ligate the elements of the DNA vector are well known to one 
skilled in the art (see, e.g, , Sambrook et ai , 1989, supra). 

The term "operably linked" is defined herein as a configuration in which a control se- 
quence is appropriately placed at a position relative to the polypeptide encoding sequence 
5 such that the control sequence directs the expression of the polypeptide or is otherwise in- 
volved in the production of the polypeptide. 

In the following different control sequences are discussed in further detail. The control 
sequences are those which are operably linked to the variants of the polynucleotide sequence 
of interest (when the polynucleotide sequence of interest encodes a polypeptide), and those 
10 which constitute the parent polynucleotide sequence of interest (when this parent polynucleo- 
tide sequence is a control sequence). It will be understood that one and the same control se- 
quence may be used either as a parent polynucleotide sequence of interest (when the library is 
a library of variant control sequences) or as a control sequence involved in the production of a 
polypeptide encoded by a polynucleotide sequence of interest (when the library is a library of 
15 variant polypeptide encoding sequences) and the following disclosure is intended to cover 
both types of use of the control sequence. 

The control sequence may be an appropriate promoter sequence, a nucleic acid sequence 
which is recognized by a host cell for expression of a polypeptide encoding sequence. The 
promoter sequence contains transcriptional control sequences which mediate the expression of 
20 the polypeptide. The promoter may be any nucleic acid sequence which shows transcriptional 
activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may 
be obtained fi*om genes encoding extracellular or intracellular polypeptides either homologous 
or heterologous to the host cell. 

Examples of suitable promoters for directing the transcription of a polypeptide encoding 
25 nucleotide sequence in a filamentous fungal host cell are promoters obtained from the genes 
encoding Aspergillus oryzae TAKA amylase, Rhizomucor ntiehei aspartic proteinase, Asper- 
gillus niger neutral alpha-amylase, Aspergillus niger acid stable alpha-amylase, Aspergillus 
niger or Aspergillus awamori glucoamylase (glaAX Rhizomucor miehei lipase, Aspergillus 
oryzae alkaline protease, Aspergillus oryzae triose phosphate isomerase, Aspergillus nidulans 
30 acetamidase, Fusarium oxysporum trypsin-like protease (U.S. Patent No. 4,288,627), and 
mutant, truncated, and hybrid promoters thereof. Particularly preferred promoters for use in 
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filamentous fungal host cells are the TAKA amylase, NA2-tpi (a hybrid of the promoters from 
the genes encoding Aspergillus niger neutral alpha-amylase and Aspergillus oryzae triose 
phosphate isomerase), and glaA promoters. 

The control sequence may also be a suitable transcription terminator sequence, a se- 
5 quence recognized by a filamentous fungal cell to terminate transcription. The terminator se- 
quence is operably linked to the 3' temiinus of the polypeptide encoding nucleotide sequence. 
Any terminator which is functional in the filamentous fungal cell may be used in the present 
invention. 

Preferred terminators are obtained fi-om the genes encoding Aspergillus oryzae TAKA 
10 amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate synthase, Asper- 
gillus niger alpha-glucosidase, and Fusarium oxysporum trypsin-like protease. 

The control sequence may also be a suitable leader sequence, a nontranslated region of a 
mRNA which is important for translation by the filamentous fungal cell. The leader sequence 
is operably linked to the 5' terminus of the polypeptide encoding nucleotide sequence. Any 
15 leader sequence which is functional in the filamentous fungal cell may be used in the present 
invention. 

Preferred leaders are obtained from the genes encoding Aspergillus oryzae TAKA amy- 
lase and Aspergillus nidulans triose phosphate isomerase. 

The control sequence may also be a polyadenylation sequence, a sequence which is op- 
20 erably linked to the 3' terminus of the polypeptide encoding nucleotide sequence which, when 
transcribed, is recognized by a filamentous fungal cell as a signal to add polyadenosine resi- 
dues to transcribed mRNA. Any polyadenylation sequence which is functional in the fila- 
mentous fungal cell may be used in the present invention. 

Preferred polyadenylation sequences are obtained from the genes encoding Aspergillus 
25 oryzae TAKA amylase, Aspergillus niger glucoamylase, Aspergillus nidulans anthranilate 
synthase, and Aspergillus niger alpha-glucosidase. 

The control sequence may also be a signal peptide coding region, which codes for an 
amino acid sequence linked to the amino terminus of the polypeptide which can direct the en- 
coded polypeptide into the cell's secretory pathway. The 5' end of the polypeptide encoding 
30 nucleotide sequence may inherently contain a signal peptide coding region naturally linked in 
translation reading frame with the segment of the coding region which encodes the secreted 
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polypeptide. Alternatively, the 5' end of the coding sequence may contain a signal peptide 
coding region which is foreign to the coding sequence. The foreign signal peptide coding re- 
gion may be required where the coding sequence does not normally contain a signal peptide 
coding region. Alternatively, the foreign signal peptide coding region may simply replace the 
5 natural signal peptide coding region in order to obtain enhanced secretion of the polypeptide. 
The signal peptide coding region may be obtained from a glucoamylase or an amylase gene 
from an Aspergillus species, or a lipase or proteinase gene from a Rhizomucor species. How- 
ever, any signal peptide coding region which directs the expressed polypeptide into the se- 
cretory pathway of a filamentous fiingal cell may be used in the present invention. 

10 An effective signal peptide coding region is the signal peptide coding region obtained 

from the Aspergillus oryzae TAKA amylase gene, Aspergillus niger neutral amylase gene, 
Rhizomucor miehei aspartic proteinase gene, or Humicola lanuginosa cellulase gene. 

The control sequence may also be a propeptide coding region, which codes for an amino 
acid sequence positioned at the amino terminus of a polypeptide. The resultant polypeptide is 

15 known as a proenzyme or propolypeptide (or a zymogen in some cases). A propolypeptide is 
generally inactive and can be converted to a mature, active polypeptide by catalytic or auto- 
catalytic cleavage of the propeptide from the propolypeptide. The propeptide coding region 
may be obtained from the Rhizomucor miehei aspartic proteinase gene, or the Myceliophthora 
thermophila laccase gene (WO 95/33836). 

20 Where both signal peptide and propeptide regions are present at the amino terminus of a 

polypeptide, the propeptide region is positioned next to the amino terminus of a polypeptide 
and the signal peptide region is positioned next to the amino terminus of the propeptide re- 
gion. 

The polypeptide encoding nucleotide sequence may also be operably linked to one or 
25 more nucleic acid sequences which encode one or more factors that are advantageous for di- 
recting the expression of the polypeptide, e.^., a transcriptional activator (eg., a frawj-acting 
factor), a chaperone, and a processing protease. Any factor that is functional in a filamentous 
fimgal cell may be used in the present invention. The nucleic acids encoding one or more of 
these factors are not necessarily in tandem with the polypeptide encoding nucleotide se- 
30 quence. 
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An activator is a protein which activates transcription of a polypeptide encoding nucleo- 
tide sequence (Kudla et aL, 1990, EMBO Journal 9: 1355-1364; Jarai and Buxton, 1994, Cur- 
rent Genetics 26: 2238-244; Verdier, 1990, Yeast 6: 271-297). The nucleic acid sequence en- 
coding an activator may be obtained from the genes encoding Saccharomyces cerevisiae heme 
5 activator protein 1 (hapl), Saccharomyces cerevisiae galactose metabolizing protein 4 (gal4), 
Aspergillus nidulans ammonia regulation protein (areA\ and Aspergillus oryzae alpha- 
amylase activator (amyR). For further examples, see Verdier, 1990, supra and MacKenzie et 
al, \993, Journal of General Microbiology 139: 2295-2307. 

A chaperone is a protein which assists another polypeptide to fold properly (Hartl et aL^ 

10 1994, TIBS 19: 20-25; Bergeron et al, 1994, TIBS 19: 124-128; Demolder et ai, 1994, Jour- 
nal of Biotechnology 32: 179-189; Craig, 1993, Science 260: 1902-1903; Gething and Sam- 
brook, 1992, Nature 355: 33-45; Puig and Gilbert, 1994, Journal of Biological Chemistry 
269: 116A-111\\ Wang and Tsou, 1993, The FASEB Journal 7: 1515-1 1 157; Robinson et al, 
1994, Bio/Technology 1: 381-384; Jacobs et al, 1993, Molecular Microbiology 8: 957-966). 

15 The nucleic acid sequence encoding a chaperone may be obtained from the genes encoding 
Aspergillus oryzae protein disulphide isomerase or Saccharomyces cerevisiae calnexin, Sac- 
charomyces cerevisiae BiP/GRP78, and Saccharomyces cerevisiae Hsp70. For further exam- 
ples, see Gething and Sambrook, 1992, supra, and Hartl et al, 1994, supra, 

A processing protease is a protease that cleaves a propeptide to generate a mature bio- 

20 chemically active polypeptide (Enderlin and Ogrydziak, 1994, Yeast 10: 67-79; Fuller et al, 
1989, Proceedings of the National Academy of Sciences USA 86: 1434-1438; Julius et aL, 
1984, Cell 37: 1075-1089; Julius et al, 1983, Cell 32: 839-852; U.S. Patent No. 5,702,934). 
The nucleic acid sequence encoding a processing protease may be obtained from the genes 
encoding Saccharomyces cerevisiae dipeptidylaminopeptidase, Saccharomyces cerevisiae 

25 Kex2, Yarrowia lipolytica dibasic processing endoprotease (xpr6), and Fusarium oxysporum 
metalloprotease (p45 gene). 

It may also be desirable to add regulatory sequences which allow the regulation of the 
expression of the polypeptide relative to the growth of the filamentous fungal cell. Examples 
of regulatory systems are those which cause the expression of the gene to be turned on or off 

30 in response to a chemical or physical stimulus, including the presence of a regulatory com- 
poxmd. The TAKA alpha-amylase promoter, Aspergillus niger glucoamylase promoter, and 
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the Aspergillus oryzae glucoamylase promoter may be used as regulatory sequences. Other 
examples of regulatory sequences are those which allow for gene amplification, e.g., the met- 
allothionein genes which are amplified with heavy metals. In these cases, the polypeptide en- 
coding nucleotide sequence would be operably linked with the regulatory sequence. 
5 The introduction of the DNA vector into a filamentous fungal cell may involve a process 

consisting of protoplast formation, transformation of the protoplasts, and regeneration of the 
cell wall in a manner known per se. Suitable procedures for transformation of Aspergillus 
host cells are described in EP 238 023 and Yelton et al.y 1984, Proceedings of the National 
Academy of Sciences USA 81: 1470-1474. A suitable method of transforming Fusarium spe- 
10 cies is described by Malardier et al, 1989, Gene 78: 147-156 or in WO 96/00787. 

Fungal Cells 

The fungal cells to be transformed with the population of DNA vectors are filamentous 
fungal cells. 

15 "Filamentous fungi" include all filamentous forms of the subdivision Eumycota and Oo- 

mycota (as defined by Hawksworth et al., 1995, supra). The filamentous fungi are character- 
ized by a mycelial wall composed of chitin, cellulose, glucan, chitosan, mannan, and other 
complex polysaccharides. Vegetative growth is by hyphal elongation and carbon catabolism 
is obligately aerobic. In contrast, vegetative growth by yeasts such as Saccharomyces cere- 

20 visiae is by budding of a unicellular thallus and carbon catabolism may be fermentative. In a 
preferred embodiment, the filamentous fungal cell is a cell of a species of, but is not limited 
to, Acremonium, Aspergillus, Fusarium^ Humicola, Mucor, Myceliophthora, Neurospora^ 
Penicillium, Scytalidium, Thielavia, Tolypocladium, and Trichoderma, 

The filamentous fungal cell to be used in the present invention is normally chosen on the 

25 basis of the polynucleotide sequence of interest. For instance, if the polynucleotide of interest 
is a control sequence intended for use in an Aspergillus cell the filamentous fungal cell is 
normally an Aspergillus cell. Examples of filamentous fungal cells of use in the present in- 
vention include an Aspergillus cell, an Acremonium cell, a Fusarium cell, a Humicola cell, a 
Mucor cell, a Myceliophthora cell, a Neurospora cell, a Penicillium cell, a Thielavia cell, a 

30 Tolypocladium cell, and a Trichoderma cell. 
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More specifically, the filamentous fiingal cell is an Aspergillus awamori, Asper- 

gillus foetidus^ Aspergillus japonicus^ Aspergillus nidulans^ Aspergillus niger^ or Aspergillus 
oryzae cell; 

a Fusarium bactridioides^ Fusarium cerealis, Fusarium crookwellense^ Fusarium culmorum, 
5 Fusarium graminearum^ Fusarium graminum, Fusarium heterosporumy Fusarium negundiy 
Fusarium oxysporum^ Fusarium reticulatum^ Fusarium roseum, Fusarium sambucinum^ 
Fusarium sarcochroum^ Fusarium sporotricioideSy Fusarium sulphureum^ Fusarium torulo- 
sum, Fusarium trichothecioides^ Fusarium venenatum cell or a Fusarium venenatum cell 
(Nirenberg sp. nov; a Humicola insolens cell or a Humicola lanuginosa cell; a Mucor miehei 
10 cell; a Myceliophthora thermophila cell; a Neurospora crassa cell; a Penicillium purpuro- 
genum cell; a Thielavia terrestris cell; or a Trichoderma harzianum, Trichoderma koningiiy 
Trichoderma longibrachiatumj Trichoderma reesei, or Trichoderma viride cell. 

Selecting or screening the library for transformants of interest 

15 It will be understood that the method to be used for performing the selection or 

screening step c) of the method of the invention will depend on the polynucleotide variant in 
question. If the polynucleotide variant encodes a polypeptide with a certain function the 
screening will be performed to select transformants expressing a polypeptide with the 
desired property. For instance, if the polynucleotide sequence encodes an enzyme, such as a 

20 lipase, the selection or screening step c) will be performed to select transformants 
expressing lipase activity. If it is desired that the lipase to be selected as a specific 
characteristic, such as a high thermostability, the screening is to be performed under 
conditions (typically temperatures) at which lipases with the desired high thermostability 
can be selected. 

25 Analogously, if the polynucleotide sequence of interest is a control sequence such as a 

promoter sequence the selection or screening step c) is performed under condition in which 
promoter activity can be assessed. Typically, in the library the variant promoter sequences 
are operably linked to a second sequence to be transcribed (eg a polypeptide encoding 
sequence) so that the promoter activity can be assayed with reference to the transcription of 

30 said second sequence. 
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T Jhrarv constnicHnn in bacterial or veast hosts 

The present invention also relates to methods of constructing and screening or selecting 
a library of a polynucleotide variant in a filamentous fiingal cell, comprising: 

(a) transforming a culture of bacterial or yeast cells with a population of DNA vectors, 
5 wherein each vector comprises 

(i) a polynucleotide sequence encoding a filamentous fungal selective marker, a fila- 
mentous fungal replication initiating sequence, a bacterial or yeast selective marker 
and a bacterial or yeast replication initiating sequence, respectively, none of which 
vary substantially within the population of DNA vectors, and 
10 (ii) a polynucleotide sequence of interest, wherein more than one variant of the polynu- 

cleotide sequence is contained within the population of DNA vectors; 

(b) cultivating the bacterial or yeast cells under selective pressure; 

(c) isolating the DNA vectors from the transformants of (b); 

(d) transforming filamentous fungal cells with the DNA vectors of (c); 
15 (e) cultivating the filamentous fungal cells of (d) under selective pressure; 

(f) selecting or screening for one or more filamentous fungal transformants expressing a 
desired characteristic encoded in the modified nucleic acid sequence of (ii) above; and 

(g) isolating the filamentous fungal transformant(s) of interest. 

The advantage of using yeast or bacteria as intermediate hosts is that the transformation 
20 frequency is 100-1000 fold higher than for filamentous fungal cells such as Aspergillus, re- 
sulting in a larger library when first transforming the DNA vector into yeast or bacteria, 
and then transforming the isolated vectors into the filamentous fungal cell. 

In a preferred embodiment, the library of polynucleotide variants is prepared by ran- 
dom mutagenesis or naturally occurring allelic variants of at least one parent polynucleotide 
25 sequence having or encoding a biological function of interest as described above. 
In another preferred embodiment, the bacterial cell is a strain of E. coli. 
In another preferred embodiment, the yeast cell is a strain of Saccharomyces sp., in 
particular, strains of Saccharomyces cerevisiae or Saccharomyces kluyveri or 
Schizosaccharomyces sp. Further examples of suitable yeast cells are strains of 
30 Kluyveromyces, such as K. lactis, Hansenula, e.g. H. polymorpha, or Pichia, e.g. P. 
pastoris. 
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According to the method above, the DNA vector contains a selective marker which per- 
mits easy selection of transformed bacterial or yeast cells. Examples of bacterial selective 
markers include, but are not limited to, markers which confer antibiotic resistance, such as 
ampicillin, kanamycin, erythromycin, chloramphenicol or tetracycline resistance. Further- 
5 more, selection may be accomplished by co-transformation, e.g.y as described in WO 
91/09129, where the selective marker is on a separate vector. 

Suitable markers for yeast host cells are ADE2, fflS3, LEU2, LYS2, MET3, TRPl, and 
URA3. 

In a preferred embodiment, the bacterial or yeast selective marker provides for resistance 
10 to a biocide, wherein the biocide agent is selected from the group consisting of ampicillin, 
kanamycin, tetracyclin, chloramphenicol, neomycin, hygromycin and methotrexate. 

In another preferred embodiment, the bacterial or yeast selective marker is selected 
from the group of genes which encode a product which provides for resistance to biocide or 
viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 
15 In another preferred embodiment, the prototrophy is obtained from an enzyme selected 

from the group of metabolic pathways consisting of nucleotide synthesis, cotactor synthesis, 
amino acid synthesis, acetamide metabolism, proline metabolism, sulfate metabolism, and 
nitrate metabolism. 

For autonomous replication, the vector may further comprise an origin of replication ena- 
20 bling the vector to replicate autonomously in the bacterial or yeast cell in question. Examples 
of bacterial origins of replication are the origins of replication of plasmids pBR322, pUC19, 
pACYC177, and pACYC184 permitting replication in £. coli. 

Examples of origins of replication for use in a yeast host cell are the 2 micron origin of 
replication, ARSl, ARS4, the combination of ARSl and CEN3, and the combination of ARS4 
25 and CEN6. 

The transformation of the bacterial or yeast cell may, for instance, be effected by using 
competent cells, by electroporation, or by conjugation of bacterial cells (see, e.g.^ J. Sam- 
brook, E.F. Fritsch, and T. Maniatus, 1989, Molecular Cloning, A Laboratory Manual^ 2d 
edition, Cold Spring Harbor, New York). The cultivation of these cells under selective pres- 
30 sure may be conducted in accordance with methods known in the art. 
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The filamentous fiingal selective marker, fungal replication intiating sequence and the 
polynucleotide sequence of interest is preferably as herein, eg in the sections entitled: 
"Filamentous fungal selective marker" and "Replication initiating sequences". 

The selection or screening of transformants of interest may be performed as described 
5 above in the section entitled "Selection and screening of transformants". 

Variant polynucleotide sequences 

It will be understood that the present invention will be useful for screening or selecting 
libraries of any type of polynucleotide variants. More specifically, the variant polynucleotide 
10 sequence to be used in the methods of the present invention is derived from a parent 
polynucleotide sequence having or encoding a biological activity of interest. For instance, 
the variant polynucleotide sequences are derived from a gene encoding a polypeptide of in- 
terest by random mutagenesis of the polypeptide encoding sequence. Alternatively, the 
parent polynucleotide sequence is a control sequence as defined above, e.g., a promoter se- 
15 quence. The parent polynucleotide sequence may be a combination of a polypeptide en- 
coding sequence and a control sequence. 

In a preferred embodiment the modification of the parent polynucleotide sequence is 
done by use of a physical or chemical mutagenizing agent, use of a doped oligonucleotide, 
DNA shuffling, or by subjecting the nucleic acid sequence to PCR generated mutagenesis, 
20 or use of any combination thereof. 

Alternatively, the variant polynucleotide sequence may be obtained by subjecting the 
sequence to mutagenesis by misincorporation of nucleotide bases by using an error-prone 
polymerase or a polymerase working under suboptimal conditions in order to promote for- 
mation of errors, i.e., error-prone PCR. Error-prone DNA synthesis may be carried out in 
25 vitro or in vivo, such as by use of various mutator strains. 

Random mutagenesis 

A general approach to prepare variant polynucleotide sequences encoding modified pro- 
teins and enzymes has been based on random mutagenesis, for instance, as disclosed in US 
30 4,894.331 and WO 93/01285. This approach may also be used in connection with the present 
invention. For instance, the random mutagenesis may be performed by use of a suitable 
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physical or chemical mutagenizing agent, by use of a suitable oligonucleotide, or by subjec- 
ting the polynucleotide sequence to PGR generated mutagenesis. Furthermore, the random 
mutagenesis may be performed by use of any combuiation of these mutagenizing agents. The 
mutagenizing agent may, e.g., be one which induces transitions, transversions, inversions, 
5 scrambling, deletions, and/or insertions. 

Examples of a physical or chemical mutagenizing agent suitable for the present purpose 
include ultraviolet (UV) irradiation, hydroxy lamine, N-methyl-N'-nitro-N-nitrosoguanidine 
(MNNG), O-methyl hydroxy lamine, nitrous acid, ethyl methane sulphonate (EMS), sodium 
bisulphite, formic acid, and nucleotide analogues. When such agents are used, the mutagene- 
10 sis is typically performed by incubating the polynucleotide sequence of interest in the presence 
of the mutagenizing agent of choice under suitable conditions for the mutagenesis to take 
place. 

When the mutagenesis is performed by the use of an oligonucleotide, the oligonucleotide 
may be doped or spiked with the three non-parent nucleotides during the synthesis of the ol i- 

15 gonucleotide at the positions which are to be changed. The doping or spiking may be done so 
that codons for unwanted amino acids are avoided. The doped or spiked oligonucleotide can 
be incorporated into the polynucleotide of interest by any published technique, using e.g. 
PGR, LCR or any DNA polymerase and ligase as deemed appropriate. 

Preferably, the doping is carried out using "constant random doping", in which the per- 

20 centage of wild-type and mutation in each position is predefined. Furthermore, the doping 
may be directed toward a preference for the introduction of certain nucleotides, and thereby 
a preference for the introduction of one or more specific amino acid residues. The doping 
may be made, e.g., so as to allow for the introduction of 90% wild type and 10% mutations 
in each position. An additional consideration in the choice of a doping scheme is based on 

25 genetic as well as protein-structural constraints. The doping scheme may be made by using 
the DOPE program (cf., Tomandl, D., et al., 1997. Journal of Computer-Aided Molecular 
Design 11:29-38; Jensen, U, Andersen, KV, Svendsen, A., and Kretzschmar, T., 1998. 
Nucleic Acids Research 26:697-702) which, inter alia, ensures that introduction of stop 
codons is avoided. 

30 When PCR-generated mutagenesis is used, either a chemically treated or non-treated par- 

ent polynucleotide sequence of interest is subjected to PGR under conditions that increase the 
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misincorporation of nucleotides (Deshler, J.O., 1992. Genetic Analysis, Techniques and Ap- 
plications 9:103-106; Leung, et al., 1989. Technique 1:11-15. 

A mutator strain of ^. coli (Fowler, et al., 1974. Molec. Gen. Genet. 133:179-191), S, 
cereviseae or any other microbial organism may be used for the random mutagenesis of the 
5 parent polynucleotide sequence of interest by, e.g., transforming a plasmid containing the 
parent polynucleotide into the mutator strain, growing the mutator strain with the plasmid and 
isolating the mutated plasmid from the mutator strain. The mutated plasmid may be subse- 
quently transformed into the expression organism. 

The polynucleotide sequence to be mutagenised may be conveniently present in a 

10 genomic or cDNA library prepared from an organism harbouring the sequence. Alternatively, 
the sequence may be present on a suitable vector such as a plasmid or a bacteriophage, which 
as such may be incubated with or otherwise exposed to the mutagenising agent. The polynu- 
cleotide sequence to be mutagenised may be in isolated form. It will be understood that the 
polynucleotide sequence to be subjected to random mutagenesis is preferably a cDNA or a 

15 genomic DNA sequence. The mutated DNA sequence may ftirther comprise a DNA se- 
quence encoding functions permitting expression of the mutated DNA sequence. 

DNA shuffling 

Alternative methods for rapid preparation of variants of a polynucleotide sequence in ac- 
20 cordance with the present invention include methods of in vivo or in vitro DNA shuffling, 
wherein DNA shuffling is defined as recombination, either in vivo or in vitro, of nucleotide 
sequence fragment(s) between two or more homologous polynucleotides resulting in output 
polynucleotides (i.e., polynucleotides which have been subjected to a shuffling cycle) con- 
taining a mmiber of exchanged nucleotide fragments when compared to the input polynucleo- 
25 tides (i.e., the polynucleotides subjected to shuffling). Shuffling may be accomplished either 
in vitro or in vivo by recombination within a cell by methods described in the art , examples of 
which are listed below. 

For instance, H. Weber and Weissmann, C. (1983. Nucleic Acids Research 11:5661- 
5669) describe a method for modifying genes by in vivo recombination between two homolo- 
30 gous genes, in which recombinants are identified and isolated using a resistance marker. 
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Pompon, €t al.^ (1989, Gene 83:15-24) describe a method for shuffling gene domains of 
mammalian cytochrome P-450 by in vivo recombination of partially homologous sequences in 
Saccharomyces cerevisiae by transforming Saccharomyces cerevisiae with a linearized plas- 
raid with fiUed-in ends, and a DNA fragment being partially homologous to the ends of said 
5 plasmid. 

In WO 97/07205 a method is described in which polypeptide variants are prepared by 
shuffling different nucleotide sequences of homologous DNA sequences by in vivo recombi- 
nation using plasmid DNA as template. 

US patent no. 5,093,257 (Genencor International, Inc.) discloses a method for producing 
10 hybrid polypeptides by in vivo recombination. 

In a preferred embodiment, the variant polynucleotide sequence is obtained by in vivo 
recombination between two or more homologous nucleic acid sequences of interest, com- 
prising: 

(a) identifying at least one conserved region between the nucleic acid sequences of interest; 
15 (b) generating fragments of each of the polynucleotide sequences of interest, wherein the 
fragments comprise the conserved region(s) of (a); and 
(c) recombining the fragments of (b) by using the conserved region(s) as (a) homologous 
linking point(s). Preferably, the polynucleotide sequences of interest encode a polypep- 
tide or a part thereof or are control sequences as defined above or any combination of 
20 both. 

The term "conserved region" denotes a subsequence, preferably of at least 10 bp, 
shared by two or more sequences in which there is a degree of identity between the subse- 
quences of at least about 50%, more preferably at least about 60%, even more preferably at 
least about 70%, even more preferably at least about 80%, even more preferably at least 

25 about 90%, and most preferably at least about 97%. 

In order for the conserved region to be used as a "linking point" between two se- 
quences, the degree of identity between the sequences within the conserved region(s), is 
sufficiently high to enable hybridisation, e.g., under conditions described supra, between 
the sequences, whereby the conserved region serves as the linking point. 

30 One method for shuffling of homologous DNA sequences in vitro has been described by 

Stemmer (Stemmer, 1994. Proc. Natl. Acad. Sci. USA, 91:10747-10751; Stemmer, 1994. 
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Nature 370:389-391; Crameri. A., et al., 1997. Nature Biotechnology 15:436-438). The 
method relates to shuffling homologous DNA sequences by using in vitro PCR techniques. 
Positive recombinant genes containing shuffled DNA sequences are selected from a DNA li- 
brary based on the improved function of the expressed proteins. 
5 The above method is also described in WO 95/22625 in relation to a method for shuffling 

homologous DNA sequences. An important step in the method is to cleave the homologous 
template double-stranded polynucleotide into random fragments of a desired size followed by 
homologously reassembling the fragments into full-length genes. 

WO 98/41653 discloses a method of DNA shuffling in which a library of recombined 
10 homologous polynucleotides is constructed from a number of different input DNA templates 
and primers by induced template shifts during in vitro DNA synthesis. 

Localized random mutagenesis 

The random mutagenesis may be advantageously localised to a part of the parent polynu- 

15 cleotide sequence in question. The sequence to be modified may be, for example, the coding 
region of a gene, or a part thereof, essential for activity of the gene product, or a control se- 
quence or part thereof as defined above. A preferred example of such control sequence is a 
promoter sequence, or a functional part thereof, z.e., a part which is sufficient for affecting ex- 
pression of the nucleic acid sequence. 

20 Localised random mutagenesis may, e.g., be advantageous when certain regions of the 

polynucleotide sequence of interest have been identified to be of particular importance. For 
instance, when the polynucleotide sequence of interest encodes a polypeptide the important 
region may be one essential for a given property of the polypeptide, the modification of which 
region is expected to result in a variant in which this property has been improved. Such re- 

25 gions may normally be identified when the tertiary structure of the parent polypeptide has 
been elucidated and related to its function. Analogously, when the polynucleotide sequence of 
interest is a control sequence, such as a promoter, the region of interest may be one expected 
to be essential to or involved in promoter activity. 

The localised, or region-specific, random mutagenesis is conveniently performed by use 

30 of PCR generated mutagenesis techniques as described above or any other suitable technique 
known in the art. Alternatively, the nucleotide sequence encoding the part of the polynucleo- 
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tide sequence to be modified may be isolated, e.g., by insertion into a suitable vector, and 
said part may be subsequently subjected to mutagenesis by use of any of the mutagenesis 
methods discussed above. 



S Use of a fung al replication initiating sequence 

In a further aspect the invention relates to the use of a fungal replication intiating se- 
quence as defined herein for constructing and selecting or screening a library of variants of a 
polynucleotide sequence of interest in filamentous fungal cells. Preferably, the library is con- 
structed by the method according to the first or second aspect of the invention. In a preferred 
10 embodiment the fungal replication initiating sequence is selected from the group consisting 
of: 

(a) a replication initiating sequence having at least 50% identity with the nucleic acid se- 
quence of SEQ ID NO:l or SEQ ID NO:2 and is capable of initiating replication; 
(b) a replication initiating sequence which hybridises under low stringency conditions 

15 with (i) the nucleic acid sequence of SEQ ID NO:l or SEQ ID NO:2, or (ii) the re- 

spective complementary strands, wherein the low stringency conditions are defined 
by prehybridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 ^g/ml 
sheared and denatured salmon sperm DNA, and 25% formamide, and wash condi- 
tions are defined at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 

20 (c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activ- 

ity. 

Preferably, the replication initiating sequence is obtained from a filamentous fungal cell, in 
particular from a strain of Aspergillus^ such as A. nidulans. In the most preferred embodi- 
ment the replication initiating sequence has the nucleic acid sequence set forth in SEQ ID 
25 NO:l or SEQ ID NO: 2, or is a respective functional subsequence thereof. 

Library of variants of a p olynucleotide sequence of interest 

In a further aspect the invention relates to a library of variants of a polynucleotide se- 
quence of interest which library comprises filamentous fungal cells transformed with a 
30 population of DNA vectors, wherein each vector comprises: 
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(i) a gene encoding a fiingal selection marker and a fungal replication initiating sequence 

wherein the marker and the replication initiating sequence do not vary within 
the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors contains 
5 more than one variant of the polynucleotide sequence. 

In a preferred embodiment the vector farther comprises a nucleic acid sequence encod- 
ing a bacterial or yeast selective marker and a bacterial or yeast replication initiating se- 
quence. Preferably, the library is prepared by the method according to the first or second 
aspect of the invention. Preferably, the elements of the library, such as the DNA vector and 
10 its mentioned components are as described herein. 

The present invention is further described by the following examples which should not be 
construed as limiting the scope of the invention. 
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Examples 

Materials 

Chemicals used as buffers and substrates were commercial products of at least reagent 
grade. 



20 Plasmids 



25 



30 



pMT1505: 
pHelpl: 



pToC68: 

pAHL: 

pS02: 

pENI1127 

pENI1245 

pENI1246 



constructed as described below in Example 1 

contains the pyrG gene from A, oryzae as a selective marker and the AMAl 

sequences which enable autonomous replication in A. nidulans as described 

by Gems, D., et aL (1991. Gene 98: 61-67) 

as described in EP 0 531 372 (Novo Nordisk A/S) 

as described in WO 92/05249, containing a lipase encoding sequence 

as described in WO 96/29391, containing the Aspergillus oryzae pyrG gene 

constructed as described below in Example 1 

constructed as described below in Example 1 

constructed as described below in Example 1 
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pENI1298: constructed as described below in Example 1 
pENI1299; constructed as described below in Example 1 

Strains 

5 JaL250: a derivative of Aspergillus oryzae A 1560 in which the pyrG gene has been 
inactivated, as described in WO 98/01470 
DH5a: an E. coli host cell purchased from GIBCO BRL (Life Technologies, Inc., 

Rockville MD) 

10 Example 1: Construction of pENI1298 and pENI1299 

pMT1466 was constructed by inserting an Sphl/Narl fragment from pHelpl into 
pToC68. pMT1489 was constructed by digesting pMT1466 with SphI and StuI, then reli- 
gating. pMTlSOO was constructed by digesting pMT1489 with Aatll and Narl and ligating 

15 a linker. pMT1594 was constructed by digesting pMT1500 with Nhel and religating. 
pMT1505 was constructed by inserting a 2.7 kb Xbal fragment containing the amdS en- 
coding gene from A. nidulans genomic DNA (Corrick, CM., et al. 1987, Gene 53:63-71) 
into pMT1504 which had been cut with Nhel. pENI1127 was constructed from pMT1505 
which had been digested with Sail in order to remove one of the AMAl repeats and the 

20 central spacer region. 

pENI1127 and pMT1505 were each cut with Hindin, resulting in fragments of 2408 
bp and 5253 bp, respectively, which were purified from a gel of 1 % agarose and cloned 
into the Hindlll site in the vector pAHL which contains a lipase encoding sequence. The 
resulting plasmids were called pENI1245 and pENI1246, respectively. 

25 A 3527 bp Stul/Bbul fragment containing the pyrG gene was excised from pS02 and 

inserted into a Stul/Bbul site in both pENI1245 and pENI1246. The resulting plasmids 
were called pENI1298 and pENI1299, respectively. The restriction map for pENI1298 is 
shown in Figure 1, and for pENI1299, in Figure 2. 

30 Example 2: Expression Levels of Lipase in Independently Grown Transformants 
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JaL250 was transformed with pENI1298 and pENI1299 using standard procedures, cf., 
as described in WO 98/01470. The cells were then cultured on Cove plates at 37**C. 

Transformants appeared after three days incubation at a transformation frequency of 10* 
- 10* /^ig DNA, an increase of 100- to 10,000-fold over the transformation frequency in the 
5 absence of an AMAl sequence. 

Thirty independent transformants from the pENI1298 and pENI1299 transformations 
were isolated on Cove plates and at the same time inoculated into a 96-weIl microtiter dish 
containing 200 jal minimal media of l*vogel, 2% maltose (e.g.. Methods in Enzymolog y. 
Vol. 17 p. 84) in each well. 

10 After three days of incubation at 34°C, media from the cultures in the microtiter dish 

were assayed for lipase activity. A 10 \il aliquot of media from each well was added to a 
microtiter well containing 200 |al of a lipase substrate of 0.018% p-nitrophenylbutyrate, 
0.1% Triton X-100, 10 mM CaCIj, 50 mM Tris pH 7.5. Activity was assayed spectro- 
photometrically at 15-second intervals over a five minute period, using a kinetic microplate 

15 reader (Molecular Device Corp., Sunnyvale CA), using a standard enzymology protocol 
(e.g.. Enzyme Kinetics, Paul C.Engel, ed., 1981, Chapman and Hall Ltd.) Briefly, product 
formation is measured during the initial rate of substrate turnover and is defined as the 
slope of the curve calculated from the absorbance at 405 nm every 15 seconds for 5 min- 
utes. The arbitrary lipase activity units were normalized against the trans formant showing 

20 the highest lipase activity. For each group of thirty transformants an average value and the 
standard deviation were calculated. 

At the same time, the transformants which had been cultured on Cove plates for three 
days at 37°C were reisolated onto a second Cove plate and reinoculated into a microtiter 
well as before. The procedure of assay, reisolation and reinoculation was repeated once 

25 more after an additional three days of culture. 

The results, summarized below in Table 1, show the amount of lipase produced, rela- 
tive to the amount produced by the pENI1298 transformants six days after transformation. 
As indicated by the low values for the standard deviation, there is little variation in the li- 
pase expression level among the 30 independently grown transformants. Usually, when do- 

30 ing transformation of filamentous ftingi, a much larger relative standard deviation is seen 
(70%-100%), due to both random integration into the genome of the vector and differences 
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in numbers of vectors getting integrated. There can be 1(X)-1000 fold difference in expres- 
sion levels between the worst and the best producing fungal transformant. 



Days 


Plasmid 


% Expression 


Std. Dev. 


6 


pENI1298 


100 


17 


6 


pENI1299 


62 


20 


9 


pENI1298 


100 


17 


9 


pENI1299 


55 


37 


12 


pENI1298 


82 


27 


12 


pENI1299 


47 


57 



5 

Table 1. The average expression levels of lipase from 30 independently grown transfor- 
mants relative to pENI1298 six days after transformation and the standard deviation for 
each. 

10 Example 3: Testing for rearrangment of the plasmid 

Transformants of Jal250 containing either pENI1298 or pENI1299 were grown in YPD 
medium overnight. 

DNA was prepared from each of the transformants using a QIAprep® Miniprep Kit 
15 (QIAGEN, Venlo, The Netherlands) in which the procedure provided by the manufacturer 
had been modified. Briefly, each strain was grown in 5 ml YPD for three days. The my- 
celia were collected by filtration and washed with 200 ml of water, then transferred to a 2 
ml microftige tube and lyophilized by centrifugation under vacuum for three hours at 60**C. 
The dried mycelia was then ground and resuspended in one ml of lysis buffer (100 mM 
20 EDTA, 10 mM Tris pH8.0, 1 % tritonX-100, 500 mM guanidine-HCl, 200 mM NaCl), 
followed by thorough mixing. Twenty |ig RNAse was added to each tube which was then 
incubated at 37°C for 10 min. One hundred ^g proteinase K was added, and the reaction 
was incubated for 30 min.at 50°C. Each tube was then centrifuged for 15 min at top speed 
in a standard bench top microftige. The supernatant was applied onto a QIAprep® spin col- 
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umn, then centrifuged and filtrate discarded. The column was next washed in 0.5 ml PB 
provided in the kit, and centrifuged again for one minute. After the filtrate was discarded, 
the column was washed in 0.75 ml PE provided in the kit, then centrifuged once more for 
one minute. The column was allowed to air dry, and the DNA was eluted by addition of 
5 100 jal TE buffer followed by a final one min spin. 

Transformants were obtained when the DNA was transformed into E, coli DH5a, con- 
firming that the plasmids remained episomal in A, oryzae. Plasmid DNA was purified from 
the bacterial cells using QIAprep® Miniprep Kit (QIAGEN), according to the manufac- 
turer's instructions. 

10 The purified plasmid DNA was then digested with Seal, and the restriction pattern was 

analyzed by standard agarose gel fractionation techniques. 

When compared to the restriction patttern of the original plasmid, the results showed 
no rearrangement in five pENI1299 JaL250 transformants and only one in eight pENI1298 
JaL250 transformants, indicating the rarity of plasmid rearrangement events. 

15 

Example 4: Screening a Library 

In order to identify variant polypeptides with improved functional characteristics ex- 
pressed at a level comparable to the parental polypeptide a library of variant polynucleotide 

20 sequences was constructed and screened in Aspergillus oryzae. Polymerase chain reactions, 
using pENI1298 as the template and 2 pmol/|al of each primer: oligo 19670 as one primer, 
and as a second primer, oligo 23701, 23702 or 23703. as described below, were run under 
the following conditions: 94**C, 5 min.; 30 cycles of (94^C, 30 sec.; 50°C, 30 sec; 72°C, 1 
min.), and 72*'C, 5 min. A commercial Taq polymerase, AmpliTaq, was used as recom- 

25 mended by the supplier (Perkin-EImer Corp., Branchburg NJ, USA). The sequence of 
19670 is shown below: 

19670: CCCCATCCTTTAACTATAGCG 

30 The sequence of the doped oligos are shown below: 
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23701 : CGTGGACGGCCTTGGCTAGCCCTCCTCSACSACSACSAGTCTCGCAGGAT- 
CTG 

23702: CTGCGTGGACGGCCTTGGCTAGCCCTCCTCSACSACSACSACSAGTCTCG- 
CAGGATCTGTTTAACCAG 
5 23703 : TCTCTGCGTGGACGGCCTTGGCTAGCCCTCCTCS ACS ACSACS ACS ACS A- 
GTCTCGCAGGATCTGTTTAACCAG 

The resulting products were purified using microfuge spin columns S300 (Pharmacia- 
LKB, Uppsala SE), The purified products, including pENI1298 DNA, were then subjected 
10 to a second round of PCR amplification under the following conditions: 94°C, 5min.; 30 
cycles of (94°C, 1 min.; 50T. 10 min.; 72°C, 2 min.; and 72'*C, 7 min., using the follow- 
ing primer: 

19671: CTCCCTTCTCTGAACAATAAACCC 

15 

The products were next subjected to Dpnl digest to remove template DNA, and then 
incubated at 94°C for 30 min. to inactivate the Dpnl enzyme. 

JaL250 was transformed using 2 ng of pENI1298, which had been digested with Ball 
and SgrAl to remove most of the lipase encoding sequence, and 5 )ig of one of the PCR 

20 fi-agments originating from the 23701, 23702 or 23703 doped-oligo reactions. The vector 
and the PCR fragment were allowed to recombine in vivo as described in WO 97/07205, 
JaL250 cells were also transformed with the digested pENI1298 or each of the PCR prod- 
ucts alone. One transformant was obtained when the cells were transformed with the vector 
alone, and no transformants resulted from transformation with the PCR fragment alone. 

25 Fifteen, 12 and 26 transformants were inoculated from the 23701, 23702 and 23703 

libraries, respectively, into microtiter plates containing 200 jal of l*vogel, 2% glucose me- 
dia and incubated for 72 hours. Jal250 transformants of pENI1298 were also inoculated as 
a control. The cultures were then streaked onto Cove plates. Lipase activity in the mi- 
crotiter cultures was assayed as described above in Example 2 and in a detergent containing 

30 assay in which 10 of the microtiter culture was added to 200 ^1 of a lipase substrate pre- 
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pared using a commercial laundry detergent in microtiter wells. Activity was measured 
spectrophotometrically at 405 nm and calculated as described above in Example 2. 

Each of the transformants which showed activity in the detergent containing assay was 
reisolated on a Cove plate. From each of the Cove plates that had been incubated for 72 
' 5 hours at 37**C, two colonies were inoculated into a microtiter dish as described above, along 
with ten pENI1298 transformants, then cultured for 72 hours at 34°C. All the clones ex- 
hibited activity in the detergent assay, whereas the pENI1298 transformants did not. Fur- 
thermore, the transformants grown as independent duplicate cultures showed relatively 
similar levels of activity. The results are summarized below in Table 2. 

10 







Pnp-Butyrate Activity 


Detergent Activity 


Library 


Clone 


Average 


Std Dev 




Average 


Std Dev 




pENI1298 




96 


16 




3 


80 








Colony 1 


Colony 2 


Average 


Colony 1 


Colony 2 


Average 


23701 


1 


97 


114 


105.5 


45 


53 


49 




2 


60 


87 


73.5 


12 


19 


15.5 




3 


53 


68 


60,5 


9 


14 


11.5 


















23702 


1 


58 


55 


56.5 


16 


15 


15,5 




2 


62 


71 


66.5 


16 


21 


18,5 




3 


47 


65 


56 


17 


17 


17 




4 


49 


45 


47 


16 


17 


16,5 




5 


35 


42 


38.5 


11 


13 


12 




6 


34 


40 


37 


17 


14 


15,5 




7 


80 


79 


79,5 


30 


28 


29 




8 


49 


56 


52.5 


17 


23 


20 


















23703 


1 


111 


122 


116,5 


28 


28 


28 




2 


122 


110 


116 


32 


28 


30 




3 


106 


98 


102 


32 


29 


30.5 
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4 


55 


32 


43,5 


11 


6 


8.5 



Table 2. Lipase activity on pnp-butyrate and a commercial laundry detergent. 



Because clone 23701.1 showed excellent expression as well as performance in both the 
5 pnp-butyrate and detergent assays, DNA was isolated in order to determine the deduced N- 
terminal sequence of the polypeptide, which was determined to be SPPRRPP. The natu- 
rally occurring N-terminal sequence is SPIRRE, which encodes a propeptide that is cleaved 
off by a kex2-like protease. DNA from some of the other transformants was sequenced, 
giving rise to the following deduced N-terminal sequences: 23701 .2 (SPPRRRR), 23702. 1 
10 (SPPRPRRR). 23702.7 (SPPRPRRP), 23703.1 (SPPRRRRRP), and 23703.4 
(SPPRRPRRR). Thus, a variant polypeptide with improved functional characteristics which 
can be expressed at a level comparable to the parental polypeptide was identified in a pro- 
duction host. 

The invention described and claimed herein is not to be limited in scope by the specific 
15 embodiments herein disclosed, since these embodiments are intended as illustrations of sev- 
eral aspects of the invention. Any equivalent embodiments are intended to be within the 
scope of this invention. Indeed, various modifications of the invention in addition to those 
shown and described herein will become apparent to those skilled in the art from the forego- 
ing description. Such modifications are also intended to fall within the scope of the appended 
20 claims. 

All publications mentioned herein are incorporated herein by reference for the purpose of 
describing and disclosing the particular information for which the publication was cited. The 
publications discussed above are provided solely for their disclosure prior to the filing date of 
the present application. Nothing herein is to be construed as an admission that the inventors 

25 are not entitled to antedate such disclosure by virtue of prior invention. 

It is to be understood that this invention is not limited to the particular methods and com- 
positions described as such may, of course, vary. It is also to be understood that the terminol- 
ogy used herein is for the purpose of describing particular embodiments only, and is not in- 
tended to be limiting since the scope of the present invention will be limited only by the ap- 

30 pended claims. 
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Unless defined otherwise all technical and scientific terms used herein have the same 
meaning as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any materials or methods similar or equivalent to those described herein 
can be used in the practice or testing of the present invention, the preferred methods and mate- 
5 rials are described. 



10 
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Claims 

1. A method of constructing and selecting or screening a library of variants of a polynu- 
cleotide sequence of interest in filamentous fungal cells wherein the method comprises: 

5 (a) transforming the fungal cells with a population of DNA vectors, wherein each 

vector comprises: 

(i) a gene encoding a ftingal selection marker and a fungal replication initiating 
sequence wherein the marker and the replication initiating sequence do not 
vary within the population; and 
10 (ii) a polynucleotide sequence of interest wherein the population of DNA vectors 

contains more than one variant of the polynucleotide sequence; 

(b) cultivating the cells under selection pressure; 

(c) selecting or screening for one or more transformants expressing a desired charac- 
teristic encoded in the modified polynucleotide sequence of (ii) above; and 

15 (d) isolating the transformant(s) of interest. 

2. The method according to claim 1, wherein the library of polynucleotide variants is pre- 
pared by random mutagenesis or naturally occurring allelic variations of at least one par- 
ent polynucleotide sequence having or encoding a biological function of interest. 

20 

3. The method of claim 3, wherein the polyncleotide sequence encodes a polypeptide or is a 
control sequence; or wherein the polynucleotide sequence encodes a polypeptide or part 
thereof and further comprises a control sequence involved in the expression of the 
polypeptide or a part of such control sequence, 

25 

4. The method according to claim 3, wherein the polypeptide is a hormone, an enzyme, a 
receptor or a portion thereof, an antibody or a portion thereof, or a reporter, or a regula- 
tory protein. 

30 5. The method of claim 4, wherein the enzyme is an oxidoreductase, a transferase, a hy- 
drolase, a lyase, an isomerase, or a ligase. 
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6, The method according to any of claims 4 or 5, wherein the enzyme is an aminopepti- 
dase. amylase, carbohydrase, carboxypeptidase, catalase, cellulase, chitinase, cutinase, 
cy clodextrin gly cosy Itransferase, deoxy ribonuclease , esterase , alpha-galactosidase , 
5 beta-galactosidase, glucoamylase, alpha-glucosidase, beta-glucosidase, invertase, lac- 

case, lipase, mannosidase, mutanase, oxidase, a pectinolytic enzyme, peroxidase, phy- 
tase, polyphenoloxidase, a proteolytic enzyme, ribonuclease, transglutaminase, or xyla- 
nase. 



10 7. The method according to claim 1 or 2, wherein the control sequence is an enhancer se- 
quence, a leader sequence, a polyadenylation sequence, a propeptide sequence, a pro- 
moter, a replication initiation sequence, a signal sequence, a transcriptional terminator 
or a translational terminator. 



15 8. The method of claim 7, wherein the promoter is derived from the gene encoding As- 
pergillus oryzae TAKA amylase, NA2-tpi and Aspergillus niger or Aspergillus awamori 
glucoamylase. 

9. The method according to any of claims 1-8, wherein the selective marker is selected 
20 from the group of genes which encode a product which provides for resistance to bio- 

cide or viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 



10. The method of claim 9, wherein the prototrophy is obtained from an enzyme selected 
from the group of metabolic pathways consisting of nucleotide synthesis, cofactor syn- 

25 thesis, amino acid synthesis, acetamide metabolism, proline metabolism, sulfate me- 

tabolism, and nitrate metabolism. 

11. The method of claim 9, wherein the selective marker is a gene selected from the group 
consisting of argB (ornithine carbamoy Itransferase), amdS (acetamidase), bar 

30 (phosphinothricin acety Itransferase), hemA (5-aminolevulinate synthase), hemB 

(porphobilinogen synthase), hygB (hygromycin phosphotransferase), niaD (nitrate re- 
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ductase), pm (proline permease), pyrG (orotidine-5*-phosphate decarboxylase), pyroA, 
riboB, sC (sulfate adenyltransferase), and trpC (anthranilate synthase). 

12. The method of any of claims 1 to 11, wherein the replication initiating sequence is a 
5 nucleic acid sequence selected from the group consisting of: 

(a) a replication initiating sequence having at least 50% identity with the nucleic acid 
sequence of SEQ ID NO:l or SEQ ID NO:2 and is capable of initiating replication; 

(b) a replication initiating sequence which hybridises under low stringency conditions 
with (i) the nucleic acid sequence of SEQ ID NO:l or SEQ ID NO:2, or (ii) the re- 

10 spective complementary strands, wherein the low stringency conditions are defined 

by prehybridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 jig/ml 
sheared and denatured salmon sperm DNA, and 25% formamide, and wash condi- 
tions are defined at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 

(c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activ- 
15 ity. 

13. The method of claim 12, wherein the nucleic acid sequence has at least 50% identity, 
more preferably about 60%, even more preferably about 70%, even more preferably 
about 80%, even more preferably about 90%, and most preferably about 97% identity 

20 with the nucleic acid sequence of SEQ ID NO: 1 or SEQ ID NO:2. 

14. The method of claim 12, wherein the replication initiating sequence is obtained from a 
filamentous fungal cell. 

25 15. The method of claim 14, wherein the filamentous fungal cell is a strain of Aspergillus, 

16. The method of claim 15, wherein the strain of Aspergillus is obtained from a strain of 
A, nidulans. 
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17. The method of any of claims 12 to 16, wherein the replication initiating sequence has 
the nucleic acid sequence set forth in SEQ ID NO:l or SEQ ID NO:2, or is a respective 
functional subsequence thereof. 

18. The method of claim 2, wherein the modification of the parent polynucleotide sequence 
5 is performed by mutagenesis, preferably random mutagenesis, by use of a physical or 

chemical mutagenizing agent, use of a doped oligonucleotide, DNA shuffling, or by 
subjecting the nucleic acid sequence to PGR generated mutagenesis, or use of any com- 
bination thereof. 

10 19. The method of claim 18, wherein the variant polynucleotide sequence is obtained by in 
vivo recombination between two or more homologous nucleic acid sequences encoding 
a polypeptide or a regulatory sequence, or any combination of both, comprising: 
(a) identifying at least one conserved region between the nucleic acid sequences of in- 
terest; 

15 (b) generating fragments of each of the polynucleotide sequences of interest, wherein 

said fragments comprise the conserved region(s) of (a); and 
(c) recombining the fragments of (b) by using the conserved region(s) as (a) homolo- 
gous linking point(s), 

20 20. The method according to any of claims 1-19, wherein the filamentous fungal cell trans- 
formed with the population of DNA vectors is a cell of a strain of Acremonium, Asper- 
gillus, Coprinus, Fusarium, Humicola, Mucor, Myceliopthom, Neurospora, Penicil- 
lium, Thielavia, Tolypocladium or Trichoderma, 

25 21. The method according to claim 20, wherein the cell is an Aspergillus oryzae, Aspergil- 
lus niger, Aspergillus nidulans, Coprinus cinereus, Fusarium oxysporum, or Trichoderma 
reesei cell. 

30 22. A method of constructing and screening or selecting a library of a polynucleotide vari- 
ant in a filamentous fungal cell, wherein the method comprises: 
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(a) transforming a culture of bacterial or yeast cells with a population of the DNA 
vectors as described in any of claims 1-21, wherein the vector further comprises a 
nucleic acid sequence encoding a bacterial or yeast selective marker and a bacterial 
or yeast replication initiating sequence; 
5 (b) cultivating the bacterial or yeast cells under selection pressure; 

(c) isolating the DNA constructs from the transformants of (b); 

(d) transforming filamentous fungal cells with the DNA constructs of (c); 

(e) cultivating the filamentous fungal cells of (d); 

(f) selecting or screening for one or more filamentous fungal transformants expressing 
10 a desired characteristic encoded in the modified nucleic acid sequence of claim 1 

(ii) above; and 

(g) isolating the filamentous fungal transformant(s) of interest. 



23. The method of claim 22, wherein the bacterial or yeast selective marker is selected 
15 from the group of genes which encode a product which provides for resistance to bio- 

cide or viral toxicity, resistance to heavy metal toxicity, or prototrophy to auxotrophs. 

23. Use of a fungal replication initiating sequence in the construction of a library of vari- 
ants of a polynucleotide sequence of interest. 

20 

24. The use according to claim 23, wherein the fungal replication initiating sequence is a 
nucleic acid sequence selected from the group consisting of: 

(a) a replication initiating sequence having at least 50% identity with the nucleic acid 
sequence of SEQ ID NO:l or SEQ ID NO:2 and is capable of initiating replication; 

25 (b) a replication initiating sequence which hybridises under low stringency conditions 

with (i) the nucleic acid sequence of SEQ ID NO:l or SEQ ID NO:2, or (ii) the re- 
spective complementary strands, wherein the low stringency conditions are defined 
by prehybridization and hybridization at 42°C in 5x SSPE, 0.3% SDS, 200 ^g/ml 
sheared and denatured salmon sperm DNA, and 25% formamide. and wash condi- 

30 tions are defined at 50°C for 30 minutes in 2x SSC, 0.2% SDS; and 
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(c) a subsequence of (a) or (b), wherein the subsequence has replication initiating activ- 
ity. 

25. The use according to claim 24, wherein the replication initiating sequence is obtained 
5 from a filamentous fungal cell, in particular from a strain of Aspergillus, such as A, 

nidulans. 

26. The use according to claim 24 or 25, wherein the replication initiating sequence has the 
nucleic acid sequence set forth in SEQ ID NO:l or SEQ ID N0:2, or is a respective 

10 functional subsequence thereof 

27. A library of variants of a polynucleotide sequence of interest which library comprises 
filamentous fungal cells tranformed with a population of DNA vectors, wherein each 
vector comprises: 

15 (i) a gene encoding a fungal selection marker and a fungal replication initiating sequence 
wherein the marker and the replication initiating sequence do not vary within 
the population; and 

(ii) a polynucleotide sequence of interest wherein the population of DNA vectors contains 
more than one variant of the polynucleotide sequence. 

20 

28. The library according to claim 27, wherein the vector further comprises a nucleic acid 
sequence encoding a bacterial or yeast selective marker and a bacterial or yeast replication 
initiating sequence. 
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SEQ ID NO 1 

aagcttattttttgtatactgttttgtgatagcacgaagtttttccacggtatcttgtaaaaatatatatttgtggcggg 
cttacctacatcaaattaataagagactaattataaactaaacacacaagcaagctactttagggtaaaagtt^^ 
5 gcttttgacgtataaacgttgcttgtatttattattacaattaaaggtggatagaaaacctagagactagttagaaacta 

atctcaggtttgcgttaaactaaatcagagcccgagaggttaacagaacctagaaggggactagatatccgggtagggaa 
acaaaaaaaaaaaacaagacagccacatattagggagactagttagaagctagttccaggactaggaaaataaaagacaa 
tgataccacagtctagttgacaactagatagattctagattgaggccaaagtctctgagatccaggttagttgcaactaa 
tactagttagtatctagtclcctataactctgaagctagaataacttactactattatcctcaccactgtt^ 

10 aaacggagtgattgcaaggtgttcagagactagttattgactagtcagtgactagcaataactaacaaggtattaaccta 
ccatgtctgccatcaccctgcacttcctcgggctcagcagccttttcctcctcatmcatgctcattttccttgt^^ 
gactgtgactagtcaaagactagtccagaaccacaaaggagaaatgtcttaccactttcttcattgcttgtctcttttgc 
attatccatgtctgcaactagttagagtctagttagtgactagtccgacgaggacttgcttgtctccggattgttggagg 
aactctccagggcctcaagatccacaacagagccttctagaagactggtcaataactagttggtctttgtctgagtctga 

15 cttacgaggttgcataclcgctccctttgcctcgtcaatcgatgagaaaaagcgccaaaactcgcaatatggctttgaac 
cacacggtgctgagactagttagaatctagtcccaaactagcttggatagcttacctttgccctttgcgttgcgacaggt 
cttgcagggtatggttcctttctcaccagctgatttagctgccttgctaccctcacggcggatctgccataaagagtggc 
tagaggttataaattagcactgatcctaggtacggggctgaatgtaacttgcctttcctttctcatcgcgcggcaagaca 
ggcttgctcaaattcctacxagtcacaggggtatgcacggcgtacggaccacttgaactagtcacagattagttagcaac 

20 tagtctgcattgaatggctgtacttacgggccctcgccattgtcctgatcatttccagcttcaccctcgttgctgcaaag 
tagttagtgactagtcaaggactagttgaaatgggagaagaaactcacgaattctcgactcccttagtattgtggtcctt 
ggacttggtgctgctatatattagctaatacactagttagactcacagaaacttacgcagctcgcttgcgcttcttg 
ggagtcggggttgggagaacagtgccttcaaacaagccttcatacx:atgctacttgactagtcagggactagtcaccaag 
taatctagataggacttgcctttggcctccatcagttccttcatagtgggaggaccattgtgcaatgtaaactccatgcc 

25 gtgggagttcttgtccttcaagtgcttgaccaatatgtttctgttggcagagggaacctgtcaactagttaataactagt 
cagaaactatgatagcagtagactcactgtacgcttgaggcatcccttcactcggcagtagacttcatatggatggatat 
caggcacgccattgtcgtcctgtggactagtcagtaactaggcttaaagctagtcgggtcggcttactatcttgaaatc^ 
ggcagcgtaagctcccxgtccttaactgcctcgagatagtgacagtactctggggactttcggagatcgttatcgttatc 
gcgaatgctcggcatactaactgttgactagtcttggactagtcccgagcaaaaaggattggaggaggaggaggaaggtg 

30 agagtgagacaaagagcgaaauagagcttcaaaggctatctcmgcagtatgaaggttaagtatctagttcttgacta 
gatttaaagagatttcgactagttatgtacctggagtttggatataggaatgtgttgtggtaacgaaatgtaagggggag 
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gaaagaaaaagtcgtcaagaggtaactctaagtcggccattcctttttgggaggcgctaaccataaacggcatggtcgac 
ttagagttagctcagggaatttagggagttatctgcgaccaccgaggaacggcggaatgccaaagaatcccgatggagct 
ctagctggcggttgacaaccccaccttttggcgtttctgcggcgttgcaggcgggactggatacttcgtagaaccagaaa 
ggcaaggcagaacgcgctcagcaagagtgttggaagtgatagcatgatgtgccttgttaactaggtaccaatctgcagta 
5 tgcttgatgttatccaaagtgtgagagaggaaggtccaaacatacacgattgggagagggcctaggtataagagtttttg 
agtagaacgcatgtgagcccagccatctcgaggagattaaacacgggccggcatttgatggctatgttagtaccccaatg 
gaaacggtgagagtccagtggtcgcagataactccctaaattccctgagctaactctaagtcgaccatgccgtttatggt 
tagcgcctcccaaaaaggaatggccgacttagagttacctcttgacgactttttctttcctcccccttacatttcgttac 
cacaacacattcctatatccaaactccaggtacataactagtcgaaatctctttaaatctagtcaagaactagatactt^ 

10 accttcatactgcttagagatagcctttgaagctcttatttcgctctttgtctcactctcaccttcctcctcctcc^^ 

atcctttttgctcgggactagtccaagactagtcaacagttagtatgccgagcattcgcgataacgataacgatctccga 
aagtccccagagtactgtcactatctcgaggcagttaaggacggggagcttacgctgccggatttcaagatagtaagccg 
acccgactagctttaagcctagttactgactagtccacaggacgacaatggcgtgcctgatatccatccatatgaa^ 
actgccgagtgaagggatgcctcaagcgtacagtgagtctactgctatcatagtttctgactagttattaactagttgac 

15 aggttccctctgccaacagaaacatattggtcaagcacttgaaggacaagaactcccacggcatggagtttacattgcac 
aatggtcctcccactatgaaggaactgatggaggccaaaggcaagtcctatctagattacttggtgactagtccctgact 
agtcaagtagcatggtatgaaggcttgtttgaaggcactgttctcccaaccccgactcctaccaagaagcgcaagcgagc 
tgcgtaagtttctgtgagtctaactagtgtattagctaatatatagcagcaccaagtccaaggaccacaatactaaggga 
gtcgagaattcgtgagtttcttctcccatttcaactagtccttgactagtcactaactactttgcagcaacgagggtgaa 

20 gctggaaatgatcaggacaatggcgagggcccgtaagtacagccattcaatgcagactagttgctaactaatctgtga 
agttcaagtggtccgtacgccgtgcatacccctgtgactggtaggaatttgagcaagcctgtcngccgcgcgatgagaa 
aggaaaggcaagttacattcagccccgtacctaggatcagtgctaatttataacctctagccactctttatggcagatcc 
gccgtgagggtagcaaggcagctaaatcagctggtgagaaaggaaccataccctgcaagacctgtcgcaacgcaaagggc 
aaaggtaagctatccaagctagtttgggactagattctaactagtctcagcaccgtgtggttcaaagccatattgcgagt 

25 tttggcgctttttctcatcgattgacgaggcaaagggagcgagtatgcaacctcgtaagtcagactcagacaaagaccaa 
ctagttattgaccagtcttctagaaggctctgttgtggatcttgaggccctggagagttcctccaacaatccggagacaa 
gcaagtcctcgtcggactagtcactaactagactctaactagttgcagacatggataatgcaaaagagacaagcaatgaa 
gaaagtggtaagacatttctcctttgtggttctggactagtclttgactagtcacagtcttaaacaaggaaaatgagcat 
gaaaatgaggaggaaaaggctgctgagcccgaggaagtgcagggtgatggcagacatggtaggttaataccttgttagtt 

30 attgctagtcactgactagtcaataactagtctctgaacaccttgcaatcactccgtttgcgcagctgaacagtggtgag 
gataatagtagtaagttattctagcttcagagttataggagactagatactaactagtattagttgcaactaacctggat 
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ctcagagactttggcctcaatctagaatctatctagttgtcaactagactgtggtatcattgtcttttattttcctagtc 
ctggaactagcttctaactagtctccxtaatatgtggctgtcttgtmtttttmgmccctacccggatatctagt 
ccccttctaggttctgttaacctctcgggctctgatttagtttaacgcaaacctgagattagtttctaactagtctctag 
gttttctatccacctttaattgtaataataaatacaagcaacgtttatacgtcaaaagcatttataaacttttaccctaa 
- 5 agtagcttgcttgtgtgtttagtttataattagtctcttattaatttgatgtaggtaagcccgccacaaatatatatttt 
tacaagataccgtggaaaaacttcgtgctatcacaaaacagtatacaaaaaataagctt 

SEQ ro NO 2 

10 aagcttattttttgtatactgttttgtgatagcacgaagtttttccacggtatcttgtaaaaatatatatttgtggcggg 

cttacctacatcaaattaataagagactaattataaactaaacacacaagcaagctactttagggtaaaagtttataaat 
gcttttgacgtataaacgttgcttgtatttattattacaattaaaggtggatagaaaacctagagactagttagaaacta 
atctcaggtttgcgttaaactaaatcagagcccgagaggttaacagaacctagaaggggactagatatccgggtagggaa 
acaaaaaaaaaaaacaagacagccacatattagggagactagttagaagctagttccaggactaggaaaataaaagacaa 

15 tgataccacagtctagttgacaactagatagattctagattgaggccaaagtctctgagatccaggttagttgcaactaa 
tactagttagtatctagtctcctataactctgaagctagaataacttactactattatcctcaccactgttcagctgcgc 
aaacggagtgattgcaaggtgttcagagactagttattgactagtcagtgactagcaataactaacaaggtattaaccta 
ccatgtctgccatcaccctgcacttcctcgggctcagcagcctmcctcctcamtcatgctcattttccttgttt^^ 
gactgtgactagtcaaagactagtccagaaccacaaaggagaaatgtcttaccactttcttcattgcttgtctcttttgc 

20 attatccatgtctgcaactagttagagtctagttagtgactagtccgacgaggacttgcttgtclccggattgttggagg 
aactctccagggcctcaagatccacaacagagccttctagaagactggtcaataactagttggtctttgtctgagtct^^ 
cttacgaggttgcatactcgctccctttgcctcgtcaatcgatgagaaaaagcgccaaaactcgcaatatggctttgaac 
cacacggtgctgagactagttagaatctagtcccaaactagcttggatagcttacctttgccctttgcgttgcgacaggt 
cttgcagggtatggttcctttctcaccagctgatttagctgccttgctaccctcacggcggatctgccataaagagtggc 

25 tagaggttataaattagcactgatcctaggtacggggctgaatgtaacttgcctttcctttctcatcgcgcggcaagaca 

ggcttgctcaaattcctaccagtcacaggggmtgcacggcgtacggaccacttgaactagtcacagattagttagcaac 
^ tagtctgcattgaatggctgtacttacgggccctcgccattgtcctgatcattt(xagcttcaccclcgttgc^^ 

tagttagtgactagtcaaggactagttgaaatgggagaagaaactcacgaattctcgactcccttagtattgtggtcctt 
ggacttggtgctgctatatattagctaatacactagttagactcacagaaacttacgcagctcgcttgcgcttcttggta 

30 ggagtcggggttgggagaacagtgccttcaaacaagccttcataccatgctacttgactagtcagggactagtcaccaag 
taatctagataggacttgcctttggcctccatcagttccttcatagtgggaggaccattgtgcaatglaaactccatgcc 
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gtgggagttcttgtccttcaagtgcttgaccaatatgtttctgttggcagagggaacctgtcaactagttaataactagt 
cagaaactatgatagcagtagactcactgtacgcttgaggcatcccttcactcggcagtagacttcatatggatggatat 
caggcacgccattgtcgtcctgtggactagtcagtaactaggcttaaagclagtcgggtcggcttactatcttgaaatcc 
ggcagcgtaagctccccgtcxttaactgcctcgagatagtgacagtactctggggactttcggagatcgttatcgtta^^ 
5 gcgaatgctcggcatactaactgttgactagtcttggactagtcccgagcaaaaaggattggaggaggaggaggaaggtg 
agagtgagacaaagagcgaaataagagcttcaaaggctatctctaagcagtatgaaggttaagtatctagttcttgacta 
gatttaaagagatttcgactagttatgtacctggagtttggatataggaatgtgttgtggtaacgaaatgtaagggggag 
gaaagaaaaagtcgtcaagaggtaactctaagtcggccattcctttttgggaggcgctaaccataaacggcatggtcgac 
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